Sampling the Arabidopsis transcriptome with massively parallel pyrosequencing.

نویسندگان

  • Andreas P M Weber
  • Katrin L Weber
  • Kevin Carr
  • Curtis Wilkerson
  • John B Ohlrogge
چکیده

Massively parallel sequencing of DNA by pyrosequencing technology offers much higher throughput and lower cost than conventional Sanger sequencing. Although extensively used already for sequencing of genomes, relatively few applications of massively parallel pyrosequencing to transcriptome analysis have been reported. To test the ability of this technology to provide unbiased representation of transcripts, we analyzed mRNA from Arabidopsis (Arabidopsis thaliana) seedlings. Two sequencing runs yielded 541,852 expressed sequence tags (ESTs) after quality control. Mapping of the ESTs to the Arabidopsis genome and to The Arabidopsis Information Resource 7.0 cDNA models indicated: (1) massively parallel pyrosequencing detected transcription of 17,449 gene loci providing very deep coverage of the transcriptome. Performing a second sequencing run only increased the number of genes identified by 10%, but increased the overall sequence coverage by 50%. (2) Mapping of the ESTs to their predicted full-length transcripts indicated that all regions of the transcript were well represented regardless of transcript length or expression level. Furthermore, short, medium, and long transcripts were equally represented. (3) Over 16,000 of the ESTs that mapped to the genome were not represented in the existing dbEST database. In some cases, the ESTs provide the first experimental evidence for transcripts derived from predicted genes, and, for at least 60 locations in the genome, pyrosequencing identified likely protein-coding sequences that are not now annotated as genes. Together, the results indicate massively parallel pyrosequencing provides novel information helpful to improve the annotation of the Arabidopsis genome. Furthermore, the unbiased representation of transcripts will be particularly useful for gene discovery and gene expression analysis of nonmodel plants with less complete genomic information. EST sequence accession numbers in GenBank are EH 795234 through EH 995233 and EL 000001 through EL 341852.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust analysis of 5′-transcript ends (5′-RATE): a novel technique for transcriptome analysis and genome annotation

Complicated cloning procedures and the high cost of sequencing have inhibited the wide application of serial analysis of gene expression and massively parallel signature sequencing for genome-wide transcriptome profiling of complex genomes. Here we describe a new method called robust analysis of 5'-transcript ends (5'-RATE) for rapid and cost-effective isolation of long 5' transcript ends (appr...

متن کامل

De Novo Transcriptome of the Hemimetabolous German Cockroach (Blattella germanica)

BACKGROUND The German cockroach, Blattella germanica, is an important insect pest that transmits various pathogens mechanically and causes severe allergic diseases. This insect has long served as a model system for studies of insect biology, physiology and ecology. However, the lack of genome or transcriptome information heavily hinder our further understanding about the German cockroach in eve...

متن کامل

Robust analysis of 50-transcript ends (50-RATE): a novel technique for transcriptome analysis and genome annotation

Complicated cloning procedures and the high cost of sequencing have inhibited the wide application of serial analysis of gene expression and massively parallel signature sequencing for genome-wide transcriptome profiling of complex genomes. Here we describe a new method called robust analysis of 50-transcript ends (50-RATE) for rapid and costeffective isolation of long 50 transcript ends ( 80 b...

متن کامل

Analyzing the microRNA Transcriptome in Plants Using Deep Sequencing Data

MicroRNAs (miRNAs) are 20- to 24-nucleotide endogenous small RNA molecules emerging as an important class of sequence-specific, trans-acting regulators for modulating gene expression at the post-transcription level. There has been a surge of interest in the past decade in identifying miRNAs and profiling their expression pattern using various experimental approaches. In particular, ultra-deep s...

متن کامل

Transcriptome analysis in maritime pine using laser capture microdissection and 454 pyrosequencing.

Maritime pine (Pinus pinaster Aiton) is one of the most advanced conifer models for genomics research. Conifer genomes are extremely large and major advances have recently been made in the characterization of transcriptomes. The combination of laser capture microdissection (LCM) and next-generation sequencing is a powerful tool with which to resolve the entire transcriptome of specific cell typ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Plant physiology

دوره 144 1  شماره 

صفحات  -

تاریخ انتشار 2007